None

On the Smartphone scene, iOS and Android are the two main characters on stage occasionally, Windows and BlackBerry walk-on, Tizen and system like Firefox, you just wait in the back playing little lady, Firefox Mobile’s recent moves are in Japan released the LG Fx0 transparent fuselage, but machines are not so transparent.

Japan local of three big telecommunications operators NTT Docomo, and KDDI au and SoftBank Zhijian of war has never on no stop had, they their except Android intelligent phone zhiwai also has efforts to expand other intelligent system of phone, Qian paragraph time Docomo also outflow has a Taiwan has canceled listed plans of prototype machine, run Tizen system, and au select of way is Firefox, and and Docomo different of is, au let this paragraph Firefox phone listed has.

Fx0 ultimately also is a paragraph Japan of custom machine, and AU also no for this paragraph phone launched official solutions lock, that is in XDA Shang of God of clutches reached it zhiqian, Fx0 forever are not can using any of non-au SIM card, and more further for, AU in Japan operation of is CDMA network, that is this Taiwan phone even in future can solutions lock, in domestic also only using didn’t human rights of telecommunications of phone card for calls.

Anyway, to the point.

When the AU was first published when this phone, online voice of surprising and not much, because AU in Japan’s three major carriers top long bottom, and in order to realize the overtaking, au also shrugged off the active introduction of the other two major operators of machines, such as the LG g-Flex, HTC One M7, HTC butterflies, as well as several to Motorola mobile phones. Au LOGO changed again, several months ago, the AU also opened a huge flagship store in Shinjuku, Tokyo, plus au has introduced a variety of “exotic” model of traditional Fx0 released in au long brush history look not too unexpected places.

Fx0 first impressions not so simple: the transparent fuselage design does seem a bit mean, but also makes the whole cell phone full of tacky and cheap, as au introduced mobile phone designer tokujin yoshioka feels the same to me–because this guy I’ve never heard of, his name now appears, he is really a mobile phone Designer?

Look on the one hand, Fx0 running nothing Firefox operating system popularity, at least in Japan little popularity, and au tacitly, in only a few retail stores (including of course Shinjuku flagship store) started selling this phone.

In into retail stores then with clerk description Jamboree zhihou, clerk immediately on like shock General to open has chatterbox, efforts to persuaded I don’t buy this paragraph phone’s, this paragraph phone just a paragraph “concept sex” of products’s, buy this paragraph phone consequences conceit’s BLABLABLA, until I showed that “I buy this paragraph phone is to ‘ development ‘ like of uses”, clerk only elimination stopped to let I quiet to buy buy buy.

This phone package with “delicate” two words cannot be overemphasized, LG (or au? ) For the pursuit of details has a very high, and packaging and the outside is a plain white box, taking the gold from the white boxes within boxes, original a transparent phone consumers, with a lattice pattern of rear cover, au also comes with a fully transparent back cover no pattern. If you want to buy a protected borders, thickening of AU will offer a golden border, need additional 6588 yen (347).

On personal seems, this paragraph phone of appearance also is good of, transparent of appearance let buyers temporarily forget has this paragraph price reached 500 dollars (3100 Yuan) of phone of specifications just is in the end level: 4.7 inches 720p IPS screen, and Snapdragon 400 processor, and 2370 Ma Shi battery, and 16GB built-in storage, and 1.5GB RAM, and 8 million pixel Hou reset camera and 2.1 million pixel Qian reset camera, Body can use MicroSD cards to expand the memory, maximum support 64GB, which is the current Firefox mobile “flagship” configuration, at least is “flagship”.

Is fair, Firefox phone system does not so bad, this system itself very clean, and somewhat Android and iOS hybrid of taste, compared Yu it of competition opponents–including Windows Phone 8.1, Firefox system is enough mature, Fx0 run of has is Firefox OS2.0 system has, and at least it support copy sticky posted and shear function, but this system after all also is enough developed, occasionally BUG also many.

Xia pulled notification bar is we is familiar of has, Firefox system Xia pulled notification bar zhihou, all of system switch are is located in notification bar bottom, this which including flight mode, and GPS, and Bluetooth,, this also makes control these button than TouchWiz this class of UI to comfortable have more, Firefox OS internal also has flow counter, each pulled Xia notification bar zhihou are is can saw of.

IOS style of home page is thick, neatly arranged on a circle icon on the main interface, press and drag to reorder, similar to the Android home page at the top and not close the search bar.

Unfortunately, as a new system, even after you update to version 2.0, and Firefox OS is still problematic:

1, users cannot edit the e-mail drafts are not stored on the device, which is used for Android users is simply too uncomfortable.

2, for operation of the entire system is too sensitive, especially when surfing the Web, in order to prevent phone sleep I slightly touched the screen, then I’ll know I need to capture a screen shot.

iPhone 5 case Givenchy

3, the typing experience is pain.

4, and shopping had Firefox OS of application store zhihou again back see Windows Phone of application store, on will think which on like a this greatly of wiki Wikipedia, and even has APP, also not perfect, took Line (even I) this application for’s, Firefox OS Shang of Line application basic cannot more crude has, this software of Firefox version even no provides VoIP call, and time line,, phone of entered forecast function also let I has hit phone of impulse, in addition, There is no software is worth talking about, remaining there as very early Android applications market, “pirated software”, and even software, still in the Windows Phone and Fire widely exist in the OS market. iPhone 5 case Givenchy

In addition, Firefox version Facebook client cannot use because the phone does not return or menu buttons, so even want to return to the previous page, a simple operation, and opens a small pop-up menu.

Certainly has, System press too less not Firefox OS unique of problem, annual spit slot iPhone press number too less of people has never not minority, iPhone and WP for this spit slot of solution attitude is joined has edge sliding gestures, so currently many people habits among so a sliding is returns last interface of, and Firefox is not as, on like zhiqian lift of Facebook client for, I think Firefox for this also is have good lane a lane, Don’t get return this simple operation will open the menu just a little. iPhone 5 case Givenchy

In the end, I couldn’t help thinking about a very serious question: who the hell is this mobile phone design?

Price close to 3100 Yuan for a mobile phone, it is expected that the phone is good value for money, while the Fx0 this phone is far from its pricing, same configuration of Android phones, including LG, of course, prices are much lower, but the phone problems, mostly on the system level.

So Fx0 undoubtedly on into has a paragraph developers with machine, after all General of consumers not is wants to not pass spent this price buy it (I this Taiwan signed has two years of contracts), even is for intelligent phone lovers for, this paragraph phone application lack, function limited, Japan currently sale of things Basic are not provides returned goods, especially electronic products, so basically, you buy Xia has it, it is you of has, certainly you can found a willing to took over of home on matter.

Develop deep enough, Firefox system in the present do not pose any threat to the Android and iOS, even the Windows Phone and BlackBerry OS are threatened not to, because in personal opinion, Firefox system the current system is less efficient than the previous system, Firefox improve the road is still very long.

Firefox OS starting point was strangely good, but lack of hardware and developer support has made this intelligent system lurch, as the same fledgling Tizen intelligent systems, supported by the Samsung might be able to develop more easily than Firefox.

The original AndroidAuthority 

Some pictures from Japanese.Engadget.com

None

Apple Android 75% iOS9 installation rate 6.0 is still marking time

According to tech blog VentureBeat reported, Apple updated data on Tuesday showed, iOS9 rate finally reach 75%. In early December last year, iOS9 installation rate of just 70%, increased by 5% in about a month, come as a surprise. In General, currently amounts to an average of 4 iOS devices have three cell phones running Apple’s latest operating system. Marcelo Burlon iPhone 6 Case

In January 12, Apple developers push the latest iOS9.3 first beta, iOS9 accelerated its installation rate is also high.

According to statistics, iOS9 coverage 75% with less than 4 months from the time, while the previous generation iOS8 time this took 6 months to complete, iOS9 has now become the Apple iOS systems in past dynasties has spread fastest in a generation. And now, iOS8 current installation rate has fallen to 19%, other earlier systems share only single digits: 7%.

But on the other hand, as of January of this year, latest Android in the Android camp 6.0 system penetration reached only 0.7%, while both market volume in different environments, but at the latest on the speed of the system, Android does need to be strengthened.

Apple Android 75% iOS9 installation rate 6.0 is still marking time

According to foreign media reports, Google CEO Pi Chayi (Sundar Pichai) on Tuesday announced that the company’s annual I/O developers ‘ Conference scheduled to be held from May 18 to 20th this year, and location is located in mountain view, Shoreline Ampitheatre Amphitheater. Marcelo Burlon iPhone 6 cases

In General, Google is usually at the I/O Conference released its latest version of the Android operating system, this is likely to be Android 7 operating system.

Marcelo Burlon iPhone 6 Case

Apple Android 75% iOS9 installation rate 6.0 is still marking time

Tips

None
None

Micro sweep sweep, author tips bar ~

Round Home key fast charge Hammer T3 Samsung showed two new smart watch Lei

Round Home key + fast charge: Hammer T3?

Round Home key + fast charge: Hammer T3? Samsung showed two new smart watch | Lei feng's morning news

According to the exposure of message, hammer, T3, after 10 months of research and development, will officially debut in September 2016. Look, hammer T3 features a 5.2-inch display at a resolution of 1080P mobile phones Japan JDI screen. Hammer T3 front round Home key design, estimated hammer on the T3 sensor should be provided by FPC.

Round Home key + fast charge: Hammer T3? Samsung showed two new smart watch | Lei feng's morning news

Configuration, T3 was carrying a hammer mycophenolate mofetil 820 processor, 4GB RAM+64GB ROM. Post-16 million pixels camera, supports features such as laser focusing and optical image stabilization. 3200mAh of battery capacity that enables fast charging function. It is worth mentioning that, the Netizen said hammer in a 5.7-inch screen mobile phone, but this should be the nut mobile second generation.

Samsung showed two new smart watches use does not need to connect the phone

Moschino Galaxy S4

Round Home key + fast charge: Hammer T3? Samsung showed two new smart watch | Lei feng's morning news

On September 1, the Samsung on Wednesday showed off the company’s two new smart watch, one called “Gear S3 Frontier” and another is named “Gear S3 Classic.” Samsung configuration of these smart watches a number of digital features and battery life can be as long as 4 days. In Samsung’s opinion, these smart watches will trump Apple Apple Watch.

Samsung’s two new Gear S3 smart watches, comes as Europe’s largest consumer electronics and home appliances trade fair IFA in Berlin this week ahead of the opening.

Gear S3 Frontier watch designed with rugged outdoor and Gear S3 Classic configure a more refined design, however, these smart watches greater interface is configured, this may be an attempt to appeal to male consumers.

Apple patent for flexible OLED display

Round Home key + fast charge: Hammer T3? Samsung showed two new smart watch | Lei feng's morning news

At present, the United States Patent and Trademark Office, Apple adopted a new patent related to bending design of mobile phone. Patent describes a hollow display cover structure, you can use the Sapphire Crystal, or other transparent material. This structure is based on flexible OLED displays, this allows the iPhone to avoid similar “bent”. Meanwhile, the structure can even use liquid metal, in the face of violent collisions can avoid damage. Moschino Galaxy S4 Cases

Of course, whether the patent application the iPhone8 is doubtful, we have heard of the iPhone 6 “bent” event, put it in your pocket in the deformation occurs when a device is squeezed. And now these flexible devices can prevent the occurrence of such accidents.

Machine learning unit was established following the acquisition of Turi Apple

Round Home key + fast charge: Hammer T3? Samsung showed two new smart watch | Lei feng's morning news

Earlier this month with the $ 200 million acquisition of machine learning and artificial intelligence startups after the Turi, Apple is being converted into a special machine learning Department.

It is understood that Apple has now begun to Turi team search for APP developers and data scientists, aims to turn it into a new machine learning Department. Turi will remain in the sector located in Seattle.

Machine learning Department of the new Apple product teams to develop new features, and application to Apple’s APP and future products. Before its acquisition by Apple, Turi GraphLabCreate, Turi machine learning platform was launched on the market, and TuriDistributed and TuriPredictive Services–can be used to develop recommendation engines, sentiment analysis, fraud detection and other software solutions. For Apple, which will significantly enhance the Siri voice Assistant, and a variety of services such as Apple App Store and Music.

BMW shows remote 3D imaging system for new 5 series Moschino Galaxy S4

Round Home key + fast charge: Hammer T3? Samsung showed two new smart watch | Lei feng's morning news

According to the United States media leftlanenews reported on August 29, BMW in its social media channels, the company released its remote 3D imaging systems technology (Remote 3D View) videos, BMW’s new 5-series also appeared. It is reported that the remote 3D imaging systems technology will be applied to the new BMW 5 series models, and through mobile applications, after coming to a stop, and 5 owners with real-time images around the car and the car.

BMW explained that they provide the technology to enable owners to the vehicle is rub, collision, intrusion or theft to obtain timely notification, and owners can use the shooting suspicious behavior of your application.

Musk: Tesla will be released within a few weeks of Autopilot upgrade

According to Reuters, Tesla’s CEO Yilong·masike (Elon Musk) on Twitter on Wednesday said that Tesla plans “within weeks” Autopilot upgrade released its semi-automatic driving system, data processing method to improve radar systems. New Autopilot system will improve the ability of radar signal processing, better ambient light, color changing weather conditions or failed to detect certain previously hard to find objects.

5G is not far off: Ericsson will start delivery in 2017, 5G components

Foreign media reported on 31st, Ericsson said on Tuesday it will begin in 2017 and delivered all the components needed to launch 5G mobile communication network, intergovernmental body set up on this new equipment frequencies and standards agreed 2020, three years ahead of the deadline. This Sweden said it has launched 5G with 26 partnership of telecom operators.

The vase has learned to GE you lie


20160924-vase-6

You must accept the vase, doesn’t it? You might even think it’s pretty amazing … … Moschino retina iPad Air case

So, as long as the good-looking, does “GE you lie” is also not bad, watch their faces there after all.

===========

From the Stockholm Studio Studio E.O creative,

Compared with traditional vases, such as pine, sitting like a Bell “Clank”

This vase wasn’t a “qualitative”,

Melting glass, is placed in the marble and other kinds of rigid solids cooling,

And the vase is no longer independent of ornaments,

Instead of decorations for a group, it shows a different kind of charm.

20160924-vase-1
20160924-vase-2
20160924-vase-3
20160924-vase-4
20160924-vase-5
20160924-vase-7
20160924-vase-8
20160924-vase-9

Designer says

They wanted this group to explore the relationship between space and object interaction the vase, Moschino iPad air cases

Moschino retina iPad Air case

-The words is that you like it or not like?

[via]

Don t blush said would like to introduce today is the history of pose most girls

Don't blush said, would like to introduce today, is the history of pose most girls
20160827-mdl-1

Well, going to the Spring Festival, and suggests it could be some what positions will be girl. Givenchy iPhone 6 plus cover

Givenchy iPhone 6 Plus Case

From Greece designer Lefteris Tsampikakis’s idea.

MDL is a girl. Givenchy iPhone 6 Plus Case

Its beautiful flowers,

Body hot

Best of all, it almost completed the development of all postures,

After you get, just comfortable.

20160827-mdl-2
20160827-mdl-3
20160827-mdl-4

Yes, magnetic base on white,

Is wood textured table lamp.

Without structure,

But you can move anywhere on the lamp base,

Magnetic will firmly hold it,

In order to achieve any angle and position switch.

20160827-mdl-5
20160827-mdl-6
20160827-mdl-7
20160827-mdl-8
20160827-mdl-9
20160827-mdl-10

This design, in 2016 the winners of the red dot design award.

[via]

Detailed hotspots endpoint detection of speech detection noise reduction and

As a means of human-computer interaction, speech endpoint detection in the liberation of human hands is significant. Meanwhile, work environment there are all sorts of background noise, the noise will seriously degrade audio quality, which affect the result of voice applications, such as reduced rate. Uncompressed audio data, interactive application network traffic flow in the network, thereby reducing the success rate of speech applications. Therefore, audio detection, noise Terminal speech processing and audio compression is always the focus of is still active research topic.

To be able to work with you to understand basic principles of endpoint detection and noise reduction, take you with a glimpse into the mysteries of audio compression, this hard to create public class guest HKUST flew senior engineer Li Hongliang, will bring us a keynote: details of voice detection technology hot spot–endpoint detection, noise reduction and compression.

Detailed hotspots--endpoint detection of speech detection, noise reduction and compression | Hard to create open class

Guest introduction

Li Hongliang, graduated from the Chinese University of science and technology. HKUST flew senior research engineer, engaged in the speech engine and voice cloud computing development, HKUST flying voice one of the founders of cloud, leading research and development for flying voice speech codec libraries on the cloud platform, we used more than 2 billion. Construction of leading voice of the national standards system, leading, participate in more than one voice formulation of such national standards. He shares today will be divided into two parts, the first part is the endpoint detection and noise reduction, the second part is the audio compression.

▎ Endpoint detection

First look at endpoint detection (Voice Activity Detection, VAD). Audio endpoint detection from continuous speech stream detection and effective speech. It consists of two parts, detected the starting point for effective voice front end point, detected a valid end point after the end of the speech.

Speech endpoint detection in the voice application is necessary, first of all, very simple, voice scenario is in storage or transmission, effectively isolated from continuous speech stream voice, you can reduce the amount of data being stored or transmitted. Secondly, in some scenarios, using the endpoint detection can simplify human-computer interaction, such as in the recording scene, speech endpoint detection after ending recording operation can be omitted.

Detailed hotspots--endpoint detection of speech detection, noise reduction and compression | Hard to create open class

In order to more clearly define the endpoint detection principle, first to analyze an audio. Pictured above is a simple audio only two words, the picture can be seen very intuitive, end the silence acoustic amplitude is very small, but effective speech part of the amplitude is large, the amplitude of a signal a visual indication that the size of the signal energy: silent energy value is smaller, effective voice parts of the energy value is bigger. Speech signals is a one-dimensional continuous function of time as the independent variable, the computer processing of voice data is voice signal chronological sequence sampling value, the size of these samples also said the voice signal at the sampling point of energy.

Detailed hotspots--endpoint detection of speech detection, noise reduction and compression | Hard to create open class

Sampling value in the has comes as and negative, calculation energy value Shi not need consider plus or minus,, from this meaning Shang see, using sampling value of absolute to said energy value is naturally of idea, due to absolute symbol in mathematics processing Shang not convenient, so sampling points of energy value usually using sampling value of square, a contains n a sampling points of voice of energy value can defined for which the sampling value of square and.

In this way, a voice of energy associated with the where sample size, and contains the number of sampling points. In order to investigate the variation of the sound energy, needs to be speech signal segmentation according to a fixed length such as 20 milliseconds, each split units called frames, each frame contains the same number of points, then the voice of energy per frame value.

If front of the audio part of the M0 frame energy values in a row below the energy threshold in advance of E0, the next row M0 frame energy values greater than E0, then in front of the voice is voice of the energy value. Similarly, if the consecutive frames of voice energy values is large, and subsequent smaller frame energy value and last for a certain length of time, you can think that reduced energy value that is a voice endpoint.

The question now is, threshold energy value E0 ready? M0 is how much? Ideal quiet energy value is 0, E0, ideally in the algorithm above is 0. Unfortunately, gathering audio scenarios tend to have a certain intensity of background noise, this pure background sound is muted, but its energy value is not 0, so the collected audio background usually have certain underlying energy values.

We always assume that collected the audio at the beginning a little mute, typically hundreds of milliseconds in length, this little mute is the basis we estimate threshold E0. Yes, always assume that the audio at the start of a short speech was muted, this assumption is very important!!!! In subsequent noise reduction is also used in the introduction to this assumption. In estimating the E0, selected a certain number of frames such as 100 frames before voice data (these are “silent”), calculate the average energy value, then add a value experience or multiplied by a factor greater than 1, and E0. The E0 is our benchmark for judging a frame whether the voice is muted, is larger than this is the effective voice, is less than this value is muted.

As for M0, easier to understand, the magnitude of which determines the endpoint detection sensitivity, M0 is smaller, higher endpoint detection sensitivity, whereas the lower. Apply for different endpoint detection sensitivity should be set to a different value. For example, in the application of voice-activated remote control, because the voice instructions are generally simple control instructions, such as comma or period Middle long stalled is highly unlikely, it is reasonable to increase the sensitivity of detection, M0 is set to a smaller value, the corresponding audio is usually around 200-400 Ms. Voice dictation applications, such as comma or period because there will be pauses for a long time would be preferable to reduce detection sensitivity, at which point the M0 value to a large value, the corresponding audio is usually 1500-3000 in milliseconds. Values of M0, which endpoint detection sensitivity, should in practice be made adjustable, its value according to the voice application scenarios to choose from.

More than just voice activity detection is simply the General principles, practical applications of the algorithm is far more complicated than the above. As a widely used voice processing technology, audio endpoint detection remains an active research. HKUST fly have been using recurrent neural networks (Recurrent Neural Networks, RNN) technology for voice activity detection, the practical effect to concerns fly products. Disney iPhone 6 Case

▎ Noise

Drop noise and said noise inhibit (Noise Reduction), Qian paper mentioned, actual collection to of audio usually will has must strength of background sound, these background sound General is background noise, dang background noise strength larger Shi, will on voice application of effect produced obviously of effect, like voice recognition rate reduced, endpoint detection sensitivity declined,, so, in voice of front-end processing in the, for noise inhibit is is has necessary of.

There are many kinds of noise, white noise spectrum stability and instability of fluctuation noise and impulse noise, speech applications, the steady background noise is the most common, most mature technology, the effect is best. This course discusses the steady white noise, which always assumes that the spectrum of the background noise is steady or quasi-steady.

Earlier voice activity detection is carried out in the time domain, the noise reduction process is carried out in the frequency domain, to this end, we first introduced or review for an important tool for conversion between time domain and frequency domain – Fourier transform.

In order to make it easier to understand, look at Fourier learned advanced mathematics, advanced mathematical theory suggests that a periodic 2T function satisfying Dirichlet conditions f (t), you can expand into a Fourier series:

Detailed hotspots--endpoint detection of speech detection, noise reduction and compression | Hard to create open class
Detailed hotspots--endpoint detection of speech detection, noise reduction and compression | Hard to create open class

For General continuous time domain signal f (t), for its domain [0,T], its odd after the extension, the Fourier series is as follows:

Detailed hotspots--endpoint detection of speech detection, noise reduction and compression | Hard to create open class

BN calculation as above, the above shows that any continuous time domain signal f (t), can be represented by a set of linear superposition of trigonometric functions. Or, f (t) can be formed by a Delta function is a linear combination of the infinite sequence of approximations. Signal is a signal of the Fourier series shows the frequency and amplitude of frequencies, therefore, right side of the equation can be seen as a signal f (t) spectrum, put it more starkly, signal spectrum refers to the signal which frequency components and the amplitude of each frequency. From left to right on process is a process of seeking known signal spectrum, from right to left is a signal of the process of reconstruction of the spectrum of the signal.

While signal the Fourier spectrum concept is easy to understand, but in practice to obtain the spectrum of the signal, using a generalized form of Fourier series–Fourier transform.

Fourier transform is a big family, in different fields of application, different forms, here we only give out two forms–continuous Fourier transform and discrete-time Fourier transform:

Detailed hotspots--endpoint detection of speech detection, noise reduction and compression | Hard to create open class

Where j is the imaginary unit, which is j*j=-1, which corresponds to the inverse Fourier transform are:

Detailed hotspots--endpoint detection of speech detection, noise reduction and compression | Hard to create open class

In practical applications, Fourier transform digital signal, the frequency spectrum of the signal can be. Frequency domain processing is completed, you can use the inverse Fourier transform converts the signals in the frequency domain to time domain. Yes, Fourier transform is a complete an important tool for from the time domain to the frequency domain transformations, a signal the Fourier transform, frequency spectrum can be obtained.

Above is the Fourier transform of a brief introduction, mathematical knowledge is not good friends can not read does not matter, just understand that Fourier transform of a time domain signal, you can get the signal spectrum, that is, complete following conversion:

Detailed hotspots--endpoint detection of speech detection, noise reduction and compression | Hard to create open class

Is to the left of the time domain signal corresponds to the right of the spectrum time domain signal is generally concerned about what time what value is the frequency and amplitude of frequency domain signal concern.

With these theories as a basis for understanding principles of noise reduction is much easier, noise reduction is the key to extracting a noise spectrum, noisy speech based on the noise spectrum is then do a reverse compensation operations, resulting in noise reduction of speech. This is important, what is behind is built around these words.

Noise suppressing flow as shown in the following figure:

Detailed hotspots--endpoint detection of speech detection, noise reduction and compression | Hard to create open class

Endpoint detection of similar, assuming that the audio at the beginning a little voice was background noise, this assumption is very important, because this small piece of background and background noise, is the basis of extraction of noise spectrum.

Noise reduction process: first, a little background is divided into frames, and are grouped according to the sequence of frames, each frame can be 10, or other values, the number of groups of not less than 5, then each set of background noise in the data frame using the Fourier transform of the spectrum, then the spectrum averaging background noise spectrum.

Get noise of spectrum Hou, drop noise of process on very simple has, Shang figure following left of figure in the red part that for noise of spectrum, black of line for effective voice signal of spectrum, both common constitute containing noise voice of spectrum, with containing noise voice of spectrum minus noise spectrum Hou get drop noise Hou voice of spectrum, again using FT leaves inverse transform turned returned to Shi domain in the, to get drop noise Hou of voice data.

The figure below shows the noise reduction effect

Detailed hotspots--endpoint detection of speech detection, noise reduction and compression | Hard to create open class

Left picture is the comparison of before and after noise reduction in the time domain, is to the left of noisy speech signal, you can see from the picture noise is very apparent. Noise reduction of speech signals is to the right of, as can be seen, the background noise has been inhibited.

The following comparison of the two images is in the frequency domain

Detailed hotspots--endpoint detection of speech detection, noise reduction and compression | Hard to create open class

Scissa indicates time axis, the vertical axis represents the frequency, is to the left of noisy speech, one of the bright red part is the effective voice, and those purple part is noise like sand. As can be seen from the diagram, the noise is not only “the ever-present”, but also “everywhere”, which are distributed at the various frequencies, noise reduction of speech is on the right side, you can clearly see, before the noise part of purple light a lot like sand, is the effective suppression of noise.

In practical applications, the noise reduction is a variation of the noise spectrum is often used, but with the noise reduction process be amended, which is adaptive to the process of noise reduction. This one is mute on the front length of voice data sometimes isn’t long enough, data isn’t enough background noise to get the noise spectrum is often not accurate enough, on the other hand, the background noise is not absolutely stable, but gradients and even mutate to a steady background noise.

Disney case

These reasons are required during the process of noise reduction using noise correction in a timely manner to get better noise reduction effect. Correction of noise spectrum method is to use the secondary audio mute, repeat the extraction algorithm of noise spectrum, new noise spectrum is obtained and used for correction of noise the noise spectrum, so to use endpoint detection in the noise reduction process used in how to judge the quiet. Noise spectrum method or the old and new spectrum is a weighted average, or use new noise spectrum to fully replace the use of noise spectrum.

Described above is a very simple principle of noise reduction. Practical applications of noise reduction algorithm is more complex than described above, real variety of noise sources, its mechanisms and features are more complex, so noise reduction is still an active area of research today, new technologies are emerging one after another, such as in practical applications have been using microphone array for noise suppression.

▎ Audio compression

The need for audio compression is well known, not repeat them. All audio compression systems are required to have two corresponding algorithms, a coding algorithm is run on the source side (encoding), the other is running at the receiving or decoding algorithm for user terminal (decoding).

Encoding and decoding algorithms show some asymmetry. This asymmetry is reflected in the coding and decoding efficiency can be different. Audio or video data when it is stored, usually be encoded only once, but thousands of times will be decoded, so encoding algorithm is more complex and less efficient, costs can be accepted, but the decoding algorithm must be fast, simple, and cheap. Coding algorithm and decoding algorithm of not symmetric sex also performance in coding and decoding of process usually is not inverse of, that is, decoding Hou get of data and coding zhiqian of original data can is different of, as long as they listening to up or looks is as of can, this series decoding algorithm usually called lossy of, and this corresponds to of is, if decoding Hou get and original data consistent of data, this coding and decoding called lossless of.

Audio and video encoding and decoding algorithms are lossy, because some small amount of loss of information can often be changed to compression ratio increased, coding of audio signals using a data encoding some technologies, such as waveform entropy coding, coding, parameters, coding, coding, and perceptual coding, etc.

This lecture focuses on perceptual coding, coding algorithms relative to the other, perceptual coding based on the characteristics of the human auditory (acoustic) to remove redundancy in audio signals, so as to achieve the purpose of audio compression. Compared to other audio coding algorithms (lossless), ears don’t feel obvious distortion of the conditions, can reach more than 10 times greater compression ratios.

First to introduce the psychoacoustic basis of perceptual coding. Audio compression core is to remove redundancy. So-called redundancy is in speech signal contains information that cannot be perceived by the ear, which humans determine the timbre, pitch and other information without any help, for example, the human ear can hear frequencies in the range of 20-20KHz, unable to perceive frequencies below 20Hz and infrasound frequencies higher than 20KHz ultrasound. For example, the human ear cannot hear a “not” sound. Perceptual coding is the use of such features of the human hearing system, achieve the purpose of removing redundant audio information.

Psychoacoustics in perceptual coding are: frequency masking, temporal masking, audibility threshold.

Detailed hotspots--endpoint detection of speech detection, noise reduction and compression | Hard to create open class

Frequency shield frequency shield in life in the everywhere visible, like you home in the sat in sofa Shang quiet of see TV, suddenly, is decoration of neighbors home a is harsh of drill drill wall of voice came, then you by can heard of only mobile drill issued of is strong of noise, despite at TV by issued of voice still in stimulus with you of eardrums, but you is turned a deaf ear to, that is, a strength is high of voice can completely shield a strength lower of voice, this phenomenon called frequency shield.

Detailed hotspots--endpoint detection of speech detection, noise reduction and compression | Hard to create open class

Temporal shielding to take the previous example, both in drill sound the human ear can not hear the sound of the TV, is the sound of drills just stopped for a short period of time, the human ear can’t hear TV sound, this phenomenon is known as time-domain masking. Temporal shield due to the human auditory system is a system with adjustable gain, when you listen to the sound intensity, lower gain, less noise, higher gains. Sometimes even through external means to change the gain of the auditory system, for example, covered his ears to avoid very much noise damages eardrums, while holding your breath, ear, hand behind ear is listening to common behaviors of the weaker voices. In the above example, strength a lot just disappeared when the auditory system requires a short period of time to increase the gain, it is in this short period of time domain masking.

Audibility threshold below which for audio compression is very important.

Conceived in a quiet room, a speaker can issue a frequency controlled by the computer’s voice, at first speaker less power, at a distance of hearing people hear speakers voice. And then began to gradually increase the speaker’s power, as power increases to the right can be heard when recording speaker power (sound intensity level, DB), this power is the frequency audibility threshold. Disney case

Then change the speaker audio frequencies, repeat the experiment, eventually obtaining the audibility threshold versus frequency curve shown in the following figure:

Detailed hotspots--endpoint detection of speech detection, noise reduction and compression | Hard to create open class

The diagram can be seen clearly, the human hearing system is most sensitive to frequencies in the range 1000-5000Hz voice frequency closer to the sides, human hearing is slow to respond.

Go back and look at frequency shielding case, this experiment in the room to add a frequency 150Hz, 60dB signal strength, and then repeat the experiment, experimental results of audibility threshold curve shown in the following figure:

Detailed hotspots--endpoint detection of speech detection, noise reduction and compression | Hard to create open class

Obviously seen from the figure, the audibility threshold curve around the 150Hz is strongly distorted, is improved a lot. This means, originally located near the audibility threshold 150Hz above a certain frequency of sound, there is probably a stronger signal in the 150Hz the presence of audible, which was blocked.

Perceptual coding the basic rule is, never need encoding the human ear can not hear the signal, simply put, hear signals do not need to be coded, and this nonsense is one focus of the research on speech compression. Nonsense is very easy to understand correctly the meaning of another word. Closer to home, what can’t hear it? Power is below the audible threshold signal or component, blocked the signal or component, the human ear can not hear, were referred to above “redundancy”.

Some of these are acoustic. To be a good understanding of audio compression, you also need to understand the concept of a more important: subband. Band (subband) refers to a frequency range, when the frequencies of the two tones is when a child band, you will hear two tones. More general case, if the frequency distribution in a complex signal when a child band, human feeling is the frequency of the signal is equivalent to a simple signal at the the band centre frequency, which is the core of the. Simply put, band is a frequency range, frequency signal that falls within this range can be replaced by a single frequency component.

Detailed hotspots--endpoint detection of speech detection, noise reduction and compression | Hard to create open class

General equivalent frequency sub band center frequency, amplitude sub band frequency amplitude-weighted and, a simpler method is to add the amplitude of the frequency components directly, as equivalent to the amplitude of the signal, this range of frequency components in a component, you can replace.

Set the frequency spectrum of a signal minimum value W0, W1 maximum. Sub-band coding is the frequency range between W0-W1 is divided into several sub, then each child within the component using an equivalent frequency component to replace. In this way, a complex spectrum of signals can be equated to a spectrum constitute very simple signal-spectrum were greatly simplified, require very little storage.

From the above procedure is not difficult to know, how to divide a great effect on the quality of compressed audio (are equivalent). Band classification is subband coding of a very important research topic, can be roughly divided into a fixed-width subband coding and variable-width encoding, comprehension, or explanation.

After the child with a number of different compression algorithms of different grades. Easy to know, more low bit-rate compression rate is high, with fewer, and poor sound quality. Opposite is also easy to understand.

Understanding of subband coding, audio compression is very easy to understand, a signal through a triangular filter set (the equivalent of a set of child) after being down to a small number of frequency components. Then visit these frequency components, energy or amplitudes are below audibility threshold curve ignore (deletes the component, because not heard). Reinvestigation on the remaining 22 adjacent frequency component, if one is next to the frequency screen, delete. After the above process, the spectrum of a complex signal contains frequency components is very simple, with very little data can be stored or transmitted information.

When decoded using the inverse Fourier transform to refactor the above the simple spectrum time domain, are decoded voice.

Above is the simple principle of audio compression, let’s talk about audio codec library.

Publicly available audio codec open source a lot, its features and capabilities are different, as shown below:

Detailed hotspots--endpoint detection of speech detection, noise reduction and compression | Hard to create open class

From the figure, you can see, AAC and MP3 is “high-end”, used to encode the music with high sampling rate, AMR and SPEEX is the low end of the road, you can handle the 16K sample rate below the speech signal, speech synthesis, speech recognition, speech recognition and other speech applications is enough.

HKUST fly series of speech using SPEEX, information about the algorithm as shown in the following figure:

Detailed hotspots--endpoint detection of speech detection, noise reduction and compression | Hard to create open class

A wide range of compression transform the Speex codec library, compression level a wide range to choose from, so used in network condition in more complex mobile terminal application is appropriate.

Well, that’s the entire contents of this share class.

Summary:

Audio endpoint detection, noise reduction and voice compression, a lot of people find it mysterious, difficult to understand and difficult to grasp. But the teacher whispered, usually feels big on the voice processing technology, is also easy. Original, not need is advanced of theory Foundation also can understanding these technology of key: audio endpoint detection of key is according to front of mute determine used to tell mute and effective voice of ruler, drop noise of key is using front of a small paragraph background noise extraction out noise of spectrum, audio compression method one of is full using human of heart acoustic, designated molecular with, removal redundant,.

Let us focus on speech processing technology with the latest developments in the above aspects.

(If you interested in the products and technologies of HKUST, fly, you can fly to HKUST’s website to view)

V tor Ba a sound neutral nature 299 Yuan DT235 headphones

DT235 headset is always worshipped the Asian power’s best-selling product, triple-band equalization, natural acoustic performance, suitable for most types of music, if Vítor Baía neutral natural style of voice is standard, black and white can be selected. At present, the products the store price of 299 Yuan.

OtterBox Note 4 Cases

Vítor Baía sound neutral nature 299 Yuan DT235 headphones

Vítor Baía DT235 uses a closed design, ear-muffs with high velvet, in addition to having extremely comfortable to wear, but also can effectively block outside noise, and leave you in peace to enjoy the music.

Vítor Baía sound neutral nature 299 Yuan DT235 headphones

Vítor Baía DT235 is very suitable for Office use. Its impedance is only 32 euros, with wide frequency response. Laptops, tablet computers are fully driven quality at the same price at the forefront. OtterBox

Source: zol OtterBox Note 4 Cases

Really local Nokia 2 301 gold plated

Really local! Nokia 2,301-gold plated

Viet Nam professional gold service company Karalux has launched the gold plated version of the phone, as we all know is certainly 128G 24K gold iPhone 6 and, more recently, to meet the real needs of local Gold Edition Nokia 230: is a function, as long as the gold-plated customer needs and we can help you! Moschino Samsung Galaxy Note 3 case Moschino Samsung Galaxy case

Really local! Nokia 2,301-gold plated
Really local! Nokia 2,301-gold plated

Although features machine specification configuration it will be special care, or simply mention here: 2.8 inch 240 x 320 screen, 16MB RAM,200 rear camera system for the S30, support dual SIM dual standby.

Moschino Samsung Galaxy Note 3 case

Really local! Nokia 2,301-gold plated
Really local! Nokia 2,301-gold plated

But Nokia 230 this machine is not listed, Microsoft’s website shows upcoming. Karalux website sells for $ 200, we have interest to consider starting with a gold-plated machine?

Really local! Nokia 2,301-gold plated

Tips

Really local! Nokia 2,301-gold plated
Really local! Nokia 2,301-gold plated

Micro sweep sweep, author tips bar ~

Low price Wacom MDP 123 Inkling 7765 Yen

Low price: Wacom MDP-123 Inkling 7765 Yen

Wacom is the familiar number plate manufacturers, this MDP-123 Inkling of its launch, without a Tablet, write directly on paper. By ultrasonic and infrared technology, with the receiving device to record marks. Support 1024 pressure-sensitive and other digital data should be saved. And can read and modify data through a computer. FOSSIL iPhone 5

Wacom MDP-123 Inkling now historically low price 7765 yen, about 480 Yuan, domestic water in 900~1000, the price advantage is obvious.

FOSSIL iPhone 5

Purchase address FOSSIL iPhone 5

Samsung Galaxy S7 edge determines with 20 million pixel camera

Samsung Galaxy S7 edge determines with 20 million pixel camera

From Han media to out of this Zhang pictures in the we can see, Galaxy S7 edge in shape aspects and Galaxy S6 edge difference is unlikely to, Galaxy S7 edge still is using surface screen design, its screen of face size for 5.2 inches, and is Shang surface screen part Hou total screen for 5.5 inches, while also has home keyboard of fingerprint recognition, support wireless fast filling, and SoC part also not out we of accident, it will has Exynos 8890/ Xiao long 8,202 version, memory is still LPDDR4 memory, and finally everyone awaited USB Type-C interface, I believe in fast USB transfers will not let you down.

  And, most important of all, Galaxy S7 edge has used 20 million megapixel camera sensor, and for this there is no doubt that Samsung the largest increase, know from Galaxy S5 Samsung started using 16 million pixel camera, slowly from the back with the iPhone series camera distance. Today, the Galaxy S6 models such as Note 5 has been taking pictures of far beyond iPhone 6s Plus, and Galaxy S7 series photographed carrying 20 million pixel level Vans iPhone 6 Case

Samsung Galaxy S7 edge determines with 20 million pixel camera

Tips

Samsung Galaxy S7 edge determines with 20 million pixel camera
Samsung Galaxy S7 edge determines with 20 million pixel camera

Micro sweep sweep, author tips bar ~ Vans cases

Vans iPhone 6 Case